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* NOTICES * 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

LThis document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 

CLAIMS 



[Claim(s)] 

[Claim 1] An input means to be equipment which processes a sound signal and a video signal 
and to input a sound signal, A speech recognition means to have the speech recognition mode in 
which the predetermined sound signal in the voice register mode which is made to correspond to 
a predetermined character and registers said input voice, and said input voice is recognized, and 
to recognize said inputted sound signal, A generating means to generate the character signal 
corresponding to said recognized sound signal, The signal processor characterized by having a 
display means to display the character corresponding to the character signal generated by said 
generating means, and a record means to record the character signal generated by said 
generating means with a video signal. 

[Claim 2] Said speech recognition means is a signal processor according to claim 1 characterized 
by recognizing the voice registered beforehand from the inside of said input voice. 
[Claim 3] Said speech recognition means is a signal processor claim 1 characterized by having 
the register mode which is made to correspond to a character and registers said input voice, or 
given in two. 

[Claim 4] Said speech recognition means is a signal processor according to claim 3 characterized 
by matching and memorizing the control information which has memory and starts said character 
signal to said memory, and said input voice. 

[Claim 5] It is a signal processor claim 3 characterized by matching with said character and 
registering said input voice while said speech recognition means indicates by the character with 
said display means in said register mode, or given in four. 

[Claim 6] It is the signal processor according to claim 5 characterized by matching with said two 
or more characters, and registering said input voice while said speech recognition means displays 
said two or more characters with said display means in said register mode. 

[Claim 7] It is the signal processor according to claim 6 which is equipped with a selection means 
to choose two or more characters displayed by said display means at the time of said register 
mode, and is characterized by said speech recognition means registering said input voice 
inputted to the character chosen by said selection means. 

[Claim 8] Said speech recognition means is a signal processor according to claim 3 to 7 
characterized by judging whether said input sound signal is suitable for registration in said 
register mode, and registering said input voice according to the judgment result concerned. 
[Claim 9] It is the signal processor according to claim 8 characterized by controlling said 
character generating means and said display means in order to perform a predetermined display 
while forbidding registration of the input sound signal concerned, when it judges with said input 
sound signal of said speech recognition means being unsuitable to registration. 
[Claim 10] Said speech recognition means is a signal processor according to claim 1 to 9 
characterized by outputting the control information which starts said character based on said 
recognized sound signal to said generating means. 

[Claim 11] Said speech recognition means is a signal processor according to claim 10 which has 
the memory which memorizes said control information, reads said control information from said 
memory based on said recognized sound signal, and is outputted to said generating means. 
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[Claim 12] It is the signal processor according to claim 1 1 characterized by matching said 
memory further and memorizing said control information and speech information, for said speech 
recognition means reading the control information corresponding to said recognized input sound 
signal from said memory while recognizing said input sound signal using said speech information, 
and outputting to said generating means. 

[Claim 13] Said speech recognition means is a signal processor according to claim 12 
characterized by detecting the degree of correlation with the speech information memorized by 
said memory and said input sound signal, and recognizing said input sound signal based on the 
detection result concerned. 

[Claim 14] Said speech recognition means is a signal processor according to claim 13 
characterized by controlling said character generating means and said display means in order to 
perform a predetermined display while forbidding outputting said control information from said 
memory to said generating means, when the degree of said correlation is low. 
[Claim 15] Said generating means is a signal processor according to claim 10 to 14 characterized 
by reading the character data according to the control information which was equipped with a 
storage means to memorize the character data corresponding to two or more characters, 
respectively, and was outputted from said speech recognition means from said storage means. 
[Claim 16] It is the signal processor according to claim 1 to 15 which picturizes a photographic 
subject, is equipped with an image pick-up means to output a video signal, and is characterized 
by carrying out multiplex [ of said display means ] to the image corresponding to the video signal 
outputted from said image pick-up means, and displaying the character corresponding to the 
character signal from said generating means on it. 

[Claim 17] Said speech recognition means is a signal processor according to claim 1 
characterized by having a manual setting-out means to set up said voice register mode and said 
speech recognition mode. 

[Claim 18] Claim 1 characterized by having the voice input switch which turns into a trigger 
switch for beginning registration actuation and recognition actuation in said voice register mode 
and said speech recognition mode, or a signal processor given in 1 7. 

[Claim 19] Said generating means is a signal processor according to claim 1 to 18 characterized 
by suspending generating of said character signal after predetermined time amount after starting 
generating of said character signal. 

[Claim 20] A regenerative apparatus equipped with a playback means to be equipment which 
reproduces a sound signal and a video signal, and to reproduce a sound signal, a speech 
recognition means to recognize said reproduced sound signal, a generating means to generate 
the character signal corresponding to said recognized sound signal, and a display means display 
the character corresponding to the character signal generated by said generating means. 
[Claim 21] Said speech recognition means is a regenerative apparatus according to claim 20 
characterized by having the voice register mode which is made to correspond to a 
predetermined character and registers input voice, and the speech recognition mode in which the 
predetermined sound signal in said playback voice is recognized. 
[Claim 22] Said speech recognition means is a signal processor according to claim 21 
characterized by having a manual setting-out means to set up said voice register mode and said 
speech recognition mode. 

[Claim 23] Claim 21 characterized by having the voice input switch which turns into a trigger 
switch for beginning recognition actuation in said voice register mode, or a regenerative 
apparatus given in 22. 

[Claim 24] Said speech recognition means is a regenerative apparatus according to claim 20 to - 
23 characterized by judging whether playback sound signal level is more than predetermined 
level, and performing recognition actuation according to the judgment result concerned. 
[Claim 25] Said speech recognition means is a regenerative apparatus according to claim 24 
characterized by starting recognition actuation when it judges with said playback voice level 
being more than predetermined level. 

[Claim 26] Input a sound signal, make it correspond to a character, and said input sound signal is 
registered. While displaying the character corresponding to the character signal which has 
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recognized said inputted sound signal based on said registered sound signal, generated the 
character signal corresponding to said recognized sound signal, and was generated by said 
generating means The signal-processing approach characterized by recording the character 
signal generated by said generating means with a video signal. 

[Claim 27] The playback approach characterized by displaying the character corresponding to 
the character signal which reproduced the sound signal, has recognized said reproduced sound 
signal, generated the character signal corresponding to said recognized sound signal, and was 
generated by said generating means. 



[Translation done.] 
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* NOTICES * 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 

DETAILED DESCRIPTION 

[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] Especially this invention relates to the equipment and the approach 
accompanied by a character display about a signal processor and an approach, a regenerative 
apparatus, an approach, and image pick-up equipment. 
[0002] 

[Description of the Prior Art] Video camera one apparatus VTR is known as equipment which 
processes conventionally the video signal photoed with the video camera. And in video camera 
one apparatus VTR, it had the character display circuit and there was a thing equipped with the 
function which compounds and displays a character title on the photoed video signal. 
[0003] In this kind of equipment, when a character title was displayed on a screen, the 
photography person was performing actuation by the switch. 
[0004] 

[Problem(s) to be Solved by the Invention] However, not only actuation is complicated, but by 
such approach, the display timing of a character title is not in agreement with a motion of a 
screen for actuation by hand control. Therefore, it was difficult to display a character title with 
sufficient timing on a screen. It was much more difficult especially to display two or more 
character titles according to the content of a screen. 

[0005] This invention aims at solving the problem like the above-mentioned. 

[0006] The object of further others of this invention is in the place which enables two or more 

character displays easily, and enables the display of a character title on a screen to more 

effective timing. 

[0007] 

[Means for Solving the Problem] In the signal processor according [ on the bottom of such an 
object, and ] to invention given in this claim 1 It has the speech recognition mode in which the 
predetermined sound signal in an input means to input a sound signal, the voice register mode 
which is made to correspond to a predetermined character and registers said input voice, and 
said input voice is recognized. It considered as the configuration equipped with a speech 
recognition means to recognize said inputted sound signal, a display means to display the 
character signal corresponding to said recognized sound signal on a screen, and a record means 
to record said displayed character signal with a video signal. 
[0008] 

[Embodiment of the Invention] Hereafter, the operation gestalt of this invention is explained to a 
detail using a drawing. 

[0009] Drawin g 1 is the block diagram showing the configuration of the camera one apparatus 
digital VTR with which this invention is applied as first operation gestalt. 

[0010] First, the actuation at the time of the usual record playback is explained. In drawing, the 
lens section 101 leads photographic subject light to the image pick-up section 102 including a 
lens, a zoom device, an iris, etc. The image pick-up section 102 picturizes a photographic subject 
using CCD, changes the obtained photographic subject light into an electrical signal, and outputs 
it to the video-signal processing circuit 103. 
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[001 1] To the signal outputted from the image pick-up section 102, a clamp, gamma amendment, 
a white balance, etc. process common knowledge for making the picturized signal into the video 
signal based on a television signal, and the video-signal processing circuit 103 outputs them to 
the record regenerative circuit 106 and a switch 1 15 through an adder 1 14. Here, like the after- 
mentioned, the character signal outputted from the character signal creation circuit 1 13 is added 
to the video signal outputted from the video-signal processing circuit 103 in the adder 114, and 
an adder 1 14 outputs it to the record regenerative circuit 106 and a switch 1 15. 
[0012] The record regenerative circuit 106 performs processing required in order to compress 
the amount of information and to record [-ization / addition of a synchronization, ID, etc., / error 
correcting code] it to the compressed video signal by encoding using well-known variable length 
coding, such as A/D conversion, DCT, and Huffman coding, to the video signal outputted from 
the video-signal processing circuit 103, and changes it into the gestalt suitable for record. 
Furthermore, the video signal which performed such processing is recorded on a magnetic tape. 
It connects with r side edge child at the time of video-signal record, and a switch 1 15 outputs 
the video signal outputted from the video-signal processing circuit 103 to a monitor 107. 
[0013] Thus, the recorded video signal is outputted to a switch 1 15, after the record 
regenerative circuit 106 is reproduced and an error correction, variable-length decode, Reverse 
DCT, etc. are processed. It connects with p side edge child at the time of video-signal playback, 
and a switch 1 15 outputs the regenerative signal from the record regenerative circuit 106 to a 
monitor 107. 

[0014] On the other hand, a microphone 104 collects voice and outputs the obtained voice to 
the sound signal processing circuit 105. To the sound signal from a microphone, level adjustment, 
noise reduction, etc. process common knowledge, and output the sound signal processing circuit 
105 to the record regenerative circuit 106, a switch 1 16, and the speech recognition circuit 111. 
To the sound signal from the sound signal processing circuit 105, A/D conversion, shuffling, an 
error correction, etc. process common knowledge, and record the record regenerative circuit 1 06 
on a magnetic tape. It connects with r side edge child at the time of sound signal record, and a 
switch 116 outputs the sound signal outputted from the sound signal processing circuit 105 to a 
loudspeaker 108. About the speech recognition circuit 111, it mentions later. 
[0015] Thus, the recorded sound signal is outputted to a switch 1 16, after the record 
regenerative circuit 106 is reproduced and an error correction, a DESHAFU ring, etc. are 
processed. It connects with p side edge child at the time of sound signal playback, and a switch 
116 outputs the regenerative signal from the record regenerative circuit 106 to a loudspeaker 
108. 

[0016] Next, actuation of the speech recognition circuit 1 1 1 in this gestalt is explained. Generally 
there is an object for unspecified speakers which does not limit the object for specified speakers 
and speaker who limit a speaker, but is recognized also with whose voice in a voice recognition 
unit. The object for specified speakers makes the vocabulary to recognize utter beforehand, 
registers it, and recognizes based on this registered data. Since a recognition system can be 
assigned to the specific speaker who uses it according to the equipment for these specified 
speakers, while the load of a system becomes light, a high recognition rate can be expected, and 
it is hard to be dependent also on language. 

[0017] On the other hand, since the object for unspecified speakers has unnecessary 
registration actuation, there is simple nature which a speaker is not chosen [ nature ] but 
operates speech recognition immediately, but in order to raise recognition precision, an 
arithmetic unit and a system with large-scale memory are needed. Since that a user is limited to 
about 1 individual in the case of the camcorder/movie like this gestalt, small, and low cost are 
absolutely made into conditions, it is a specified speaker and it can be said that the voice 
recognition unit for a specific vocabulary is further suitable. 

[0018] In this gestalt, the voice recognition system corresponding to a specified speaker is used 
from such a background. For this reason, the user was made to utter the vocabulary to 
recognize beforehand and it has the voice register mode which registers that voice data as a 
reference voice pattern, and the speech recognition mode in which matching with input voice and 
a reference voice pattern is performed. 



http://www4.ipdl.ncipi.gojp/cgi-bin/tran.web_cgi_ejje 



2006/01/06 



JP.2000-101919.A [DETAILED DESCRIPTION] 



3/6 ^— v 



[0019] In drawing 1 , a switch 117 performs a change-over in voice mode, and the switch 117 is 
selectable in the position c which turns off a voice input function, voice register mode a, and 
three positions in the speech recognition mode b. A switch 1 18 is a voice input switch used as 
the trigger switch at the time of inputting registration and the voice which a user utters at the 
time of recognition mode. 

[0020] Next, a character signal is generated by the speech recognition result, and the case 
where it displays on a monitor 107 is described. The speech recognition circuit 1 1 1 where 
speech recognition processing is performed to the sound signal inputted into the speech 
recognition circuit 1 1 1 through the sound signal processing circuit 105 from the microphone 104 
in the speech recognition circuit 1 1 1 like the after-mentioned outputs the speech recognition 
result to a control section 109. The speech recognition circuit 1 1 1 operates here, only when set 
as voice register mode or speech recognition mode by the switch 117. 
[0021] A control section 109 reads the character data according to the speech recognition 
result from the speech recognition circuit 1 1 1 from the character data memory 1 1 2, and outputs 
it to the character signal creation circuit 1 1 3. The character signal creation circuit 1 1 3 changes 
the character data from a control section 109 into a video signal, and outputs it to an adder 1 14. 
An adder 114 adds the video signal from the video-signal processing circuit 103, and the 
character signal from the character signal creation circuit 1 1 3, and outputs them to the record 
regenerative circuit 106 and a switch 117. The record regenerative circuit 106 and the switch 
1 17 are the same as that of the above-mentioned actuation. 

[0022] Here, a control section 109 controls the record regenerative circuit 106, a switch 117, 
and switch 1 18 grade according to actuation of a control unit 1 10 of having various actuation 
keys, such as record, playback, a halt, and mode setting, including a microcomputer. 
[0023] Next, concrete actuation of the speech recognition circuit 11 1 is explained. The sound 
signal inputted from the sound signal processing circuit 105 is amplified by predetermined gain in 
pre amplifier 111a, and is outputted to A/D-converter 111b. A/D-converter 111b carries out 
A/D conversion of the sound signal inputted from pre amplifier 111b, and outputs it to 
microprocessor 111c. Here, microprocessor 1 1 1c is performing the automatic gain control which 
requires feedback control for pre amplifier 111a so that the sound volume suitable for speech 
recognition may be inputted. 

[0024] To the sound signal from A/D-converter1 11b, like the after-mentioned, microprocessor 
111c performs speech recognition processing, and outputs the speech recognition actuation 
situation and speech recognition result to a control section 1 09 with a data bus. Moreover, 
microprocessor 111c operates according to the voice mode set up with a switch 1 1 7, and 
mentions later about the actuation. Furthermore, microprocessor 111c outputs the voice pattern 
obtained by speech recognition processing at the time of voice register mode to memory 1 1 1d. 
[0025] Drawing 2 is a flow chart which shows the flow of the speech recognition processing 
which microprocessor 111c performs. Here, the standard pattern set as the object of recognition 
is created, and the pattern-matching method which performs word speech recognition is used by 
judging whenever [ input voice and standard pattern's coincidence ]. 

[0026] In drawing 2 , it distinguishes whether speech recognition mode is set up (S201), and 
when it is in speech recognition mode, speech recognition mode processing is performed. (S202) 
It distinguishes whether on the other hand, when it is not in speech recognition mode in S201, 
voice register mode is set up (S203), and when voice register mode is set up, voice register 
mode processing is performed. (S204) 

[0027] Next, concrete actuation in each mode is explained. Drawing 3 is a flow chart which 
shows actuation of voice register mode processing of drawing 2. If the switch 1 1 7 is set to a 
terminal side, voice register mode processing is started, and a control signal will be outputted to 
a control section 109 in order to display the list screen of a character title as shown in drawing 5 
(a)inS301. 

[0028] Here, setting out of a character title is performed in the character setting-out mode set 
up by the control unit 110. That is, a control section 109 outputs a control signal to the 
character signal generation circuit 1 13 so that it may perform the display shown in the monitor 
107 at drawing 4 , if character setting-out mode is directed by the control unit 1 10. At this time, 
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image pick-up actuation by the image pick-up section 102 is not performed, but the background 
°f drawjng 4 serves as a predetermined image. 

[0029] Like drawing 4 , a user is operating a control unit 1 10, and choosing and determining a 
desired alphabetic character in 50 sound displays, and inputs a character title. A control section 
109 attaches a title number for every title to the character title inputted here, and memorizes it 
in the character data memory 112. Thus, the item list of the character titles memorized by the 
character data memory 1 12 is displayed as a character title list screen of drawing 5 (a) in S301. 
In addition, the character title list screen of drawing 5 (a) can display the list key of a control 
unit 1 10 also by carrying out a depression. 

[0030] A display of a character title list screen chooses a character title in S302. Drawing 5 (b) 
and drawin g 5 (c) can show an example of selection of a character title, and the arrow-head 
cursor on a screen can choose the item of taking up and down and a request according to the 
arrow key of a control unit 1 10. 

[0031] Thus, selection of which character title performs the input and registration of the voice 
made to correspond to a character title next If microprocessor 111c detects whether the switch 
1 18 is turned on in S303 and the switch 1 18 is turned on in it, the title number of the character 
title chosen from the control section 109 will be read. 

[0032] Next, in S304, detection and the voice analysis of a user's input voice are performed. A 
user utters a title name here that the selected character title and input voice should be made to 
correspond, and it should register. For example, if the "nice shot" of the title number 2 is chosen 
like drawin g 5 (b), "******** which is not", or utterance which assumed the actual operating 
condition like "*****♦**'' will be performed. That is, original voice can be registered by uttering 
the vocabulary of not only the vocabulary of the character title chosen as a title name but 
arbitration at the time of registration. 

[0033] The uttered voice data determines the start point of a word, and an ending point from an 
audio amplitude pattern etc., and starts a word while it is changed into the time series of an 
analysis parameter vector by voice analyses, such as band pass filter analysis. 
[0034] Next, a focus extract is performed in S305 and the focus is changed into a feature 
parameter. Here, the local peak of the obtained spectrum is detected and only these are binary- 
Hzation-extracted. Linearity or nonlinear time amount normalization processing is performed to 
the obtained feature parameter, and a voice pattern is generated. (S306) 
[0035] Next, the dependability judging of a voice pattern is performed in S307. Here, it judges 
whether the generated voice pattern has reached the level which deserves registering as a 
reference voice pattern. If judged with dependability being inadequate, a voice pattern is not 
registered, cannot be registered in S308, and it will output a control signal to a control section 
109 so that it may display the advice of the purport which needs voice reinput like shown in 
drawing 6 . (S309) It waits for a control section 109 to control the character signal creation 
circuit 113, and for a switch 1 18 to be again turned on by the monitor 107 in a predetermined 
time deed and S303 in this advice display. 

[0036] On the other hand, if judged with the dependability of a voice pattern being enough, a 
control signal will be outputted to a control section 109 in order to display the purport of the 
completion of registration, while making memory 1 1 1d memorize the title number corresponding 
to a voice pattern and it in S308, and registration actuation will be completed. 
[0037] Next, speech recognition mode processing is explained. Drawing 7 is a flow chart which 
shows actuation of speech recognition mode processing. If the switch 1 1 7 is set to b terminal 
side, it will go into speech recognition mode. If the switch 118 is turned on in S701, while 
performing detection and the voice analysis of a user's input voice in S702, the signal of the 
purport which forbids reception of actuation of other actuation keys is outputted to a control 
section 109. About a feature extraction S703 and the time amount normalization S704, the same 
actuation as S305 and S306 of above-mentioned drawi ng 3 is performed. 
[0038] Next, in S705, distance count with the input voice pattern vector and reference voice 
pattern vector of input voice and a reference voice pattern by which matching processing, i.e., 
time amount normalization, was carried out is performed. It is judged as a word the minimum 
thing in distance with each reference voice pattern calculated in S705 has been recognized to 
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be. (S706) 

[0039] Next, in S707, the degree of correlation with an input voice pattern and a reference voice 
pattern, i.e., the dependability judging of whenever [ speech recognition ], is performed. Here, it 
judges whether the distance of the input voice pattern and reference voice pattern which were 
calculated is smaller than a predetermined threshold like the above-mentioned. While outputting 
a control signal to a control section 109 in order to judge with there being no recognition 
dependability and to perform a display to that effect, if larger than a threshold, it waits to turn on 
a switch 118 again in S701. If smaller than a threshold, it will be judged with there being 
recognition dependability, and in S708, the title number of the recognized voice pattern is read 
from memory 1 1 1 d, and it outputs to a control section 1 09. 

[0040] Moreover, if a switch 1 1 7 is made into the position c of a termination to perform the title 
display by speech recognition, speech recognition actuation will be stopped and the display of a 
character title will not be performed, either. 

[0041] Drawing 8 is a flow chart which shows actuation of the control section 109 about a 
character display. In S801, a control section 109 judges whether the title number was inputted 
from the speech recognition circuit 1 1 1. If it judges that the title number was inputted, the 
character data corresponding to the title number inputted in S802 will be read from the 
character data memory 1 12, and the timer built in while outputting to the character signal 
creation circuit 1 1 3 will be started. 

[0042] In S804, when the display of a character title is continued and it exceeds 3 seconds 
based on the output of a timer while title display time did not exceed 3 seconds, a title display is 
stopped in S805 and it returns to the usual screen. In addition, although title display time was 
made into 3 seconds in this gestalt, it is not necessary to be especially 3 seconds, and if it is the 
time amount a title display is recognized to be, it is good without limit. 

[0043] Moreover, drawing 9 and drawing 10 show the example of a display of a character title. 
[0044] Thus, in this gestalt the image which displayed the character title by speech recognition, 
carried out multiplex [ of the character title ] to the picturized image, and displayed it on it is 
recordable. Therefore, the key stroke for a title display becomes unnecessary, and a user can 
express a character as the very easy approach of voice input. Therefore, a display of a character 
title is attained to desired timing, and the display also of two or more kinds of character titles is 
attained easily. 

[0045] Moreover, the input voice of arbitration can be registered corresponding to a character, 
and it can indicate by the character by uttering the input voice. Therefore, since a recognition 
system can be set as the specified speaker when a specified speaker performs voice input, a 
recognition rate becomes high. 

[0046] Furthermore, since the vocabulary of a character and the vocabulary of the input voice to 
register do not need to be the same, they can display a character in accordance with a user's 
operating condition. 

[0047] Drawing 1 1 is the block diagram showing the configuration of the digital VTR as second 
operation gestalt of this invention. The same number is attached about the same component as 
dj^wjngj , and the detailed explanation is omitted. 

[0048] With VTR of drawing 1 , the voice inputted by the microphone 104 has been recognized, 
and it generated and carried out multiplex [ of the character ] to the picturized video signal. On 
the other hand, speech recognition is performed to a playback sound signal, and a character title 
is expressed as this gestalt. That is, as shown in drawin g 1 1 , at the time of voice register mode, 
a switch 115 and a switch 1 16 are connected to r side edge child, the sound signal from the 
sound signal processing circuit 105 is inputted into the speech recognition circuit 111, and voice 
register mode processing is performed according to the flow chart of dr^ing_3 . 
[0049] Next, speech recognition mode processing is explained. If the switch 1 1 7 is connected to 
b terminal side, it will go into speech recognition mode. Moreover, at the time of speech 
recognition mode, it connects with p terminal side, respectively, and a switch 1 15,1 16 inputs the 
regenerative signal from the record regenerative circuit 106 into the speech processing circuit 
111. 

[0050] Drawing 1 2 is a flow chart which shows actuation of the speech recognition mode 
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processing in the second operation gestalt. Generally, since a user utters the voice for character 
titles in many cases and it is recorded in comparatively loud voice, it is louder than the usual 
voice level. For this reason, the voice for character titles can be distinguished with the inputted 
voice level. In S1201, it distinguishes whether voice input level is more than predetermined level 
in the regenerative signal from the record regenerative circuit 106. With [ the inputted voice 
level ] predetermined [ more than ], voice detection and a voice analysis are performed in S1202, 
and the following actuation of S1203-S1208 is the same as actuation of drawing 7 of S702-S707. 

[0051] Thus, in this gestalt, multiplex [ of the character title ] can be carried out to the image 
reproduced by speech recognition, and it can be displayed on it. Therefore, the key stroke for a 
title display becomes unnecessary, and a user can express a character as the very easy 
approach of voice input. Therefore, a display of a character title is attained to desired timing, and 
the display also of two or more kinds of character titles is attained easily. 

[0052] Moreover, the input voice of arbitration can be registered corresponding to a character, 
and it can indicate by the character by uttering the input voice. Therefore, since a recognition 
system can be set as the specified speaker when a specified speaker performs voice input, a 
recognition rate becomes high. 

[0053] Furthermore, since the vocabulary of a character and the vocabulary of the input voice to 
register do not need to be the same, they can display a character in accordance with a user's 
operating condition. 

[0054] In addition, although this gestalt explained the case where this invention was applied to 
video camera one apparatus VTR, besides this, this invention can be applied to a record 
regenerative apparatus with a character display function, and it has the same effectiveness. 
[0055] Moreover, although the alphabetic character character was displayed, you may make it 
express a graphic form etc. as the above-mentioned operation gestalt as a character besides an 
alphabetic character. Furthermore, the display position of a character can also be displayed on 
the location of not only the location shown in drawin g 9 and drawing 1 0 but arbitration, and can 
be changed also about the method of presentation. 
[0056] 

[Effect of the Invention] Since it is the configuration in which the character title display with 
voice is possible according to this invention as explained above, two or more character displays 
are enabled easily, and a character title can be expressed on a screen as more effective timing. 
[0057] Moreover, since it has the composition of registering input voice corresponding to a 
character, the rate of speech recognition of a specified speaker becomes high. 
[0058] Furthermore, since it has the composition that the vocabulary of a character may differ 
from the vocabulary of the input voice to register, a character can be displayed in accordance 
with a user's operating condition. 
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* NOTICES * 

JPO and NCIPI are not responsible for any 
damages caused by the use of this translation. 

1 This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3.1n the drawings, any words are not translated. 



DESCRIPTION OF DRAWINGS 
[Brief Description of the Drawings] 

[QLr^in&l] It is drawing showing the configuration of video camera one apparatus VTR to which 
this invention is applied as first operation gestalt. 

[Drawing 2] It is the flow chart of the speech recognition processing which the microprocessor 
°f drawing 1 performs. 

[Drawing 3] It is the flow chart which shows actuation of voice register mode processing of 
drawing 2 . 

[Drawing 4] It is drawing showing the example of a monitor display at the time of character 
setting-out mode. 

[Drawing 5] It is drawing showing the example of a display of a character title list screen. 
[ Drawin g 6] It is drawing showing the example of a display of the voice reinput advice display at 
the time of being judged with it being inadequate in dependability. 

[Drawing 7] It is the flow chart which shows actuation of speech recognition mode processing of 
drawing 2 . 

[Drawing 8] It is the flow chart which shows the actuation about the character display which the 
control section of drawing 1 performs. 

[ Drawin g 9] It is drawing showing the example of a display of a character title. 
[Drawin g 10] It is drawing showing the example of a display of a character title. 
[ Drawin g 1 1] It is drawing showing the configuration of video camera one apparatus VTR to 
which this invention is applied as second operation gestalt. 

[Drawing 12] It is the flow chart which shows actuation of the speech recognition mode 
processing in the second operation gestalt. 

[Translation done.] 
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